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Abstract — The problem of communication and state estimation 
is considered in the context of channels with action-dependent 
states. Given the message to be communicated, the transmitter 
chooses an action sequence that affects the formation of the 
channel states, and then creates the channel input sequence based 
on the state sequence. The decoder estimates the channel to 
some distortion as well as decodes the message. The capacity- 
distortion tradeoff of such a channel is characterized for the 
case when the state information is available strictly causally 
at the channel encoder. The problem setting extends the action 
dependent framework of |1| and as a special case recovers the 
results of few previously considered joint communication and 
estimation scenarios in |2|, |3|, |4|. The scenario when the action 
is also allowed to depend on the past observed states (adaptive 
action) is also considered. It is shown that such adaptive action 
yields an improved capacity-distortion function. 

I. Introduction 

Consider the example scenario of an autonomous under- 
water vehicle engaged in a classification task communicating 
with a surface station; in particular the vehicle employs active 
classification wherein it controls the views it has of the target 
(or state, S). The vehicle can modify its position, sensor 
parameters, etc. One can envision that the vehicle would 
modify its plan as it collects new information about the target 
state [5]. This scenario motivates our examination of both 
adaptive and non-adaptive active communication over channels 
with state. In particular, we are interested in scenarios where 
the encoder can select actions (dependent on the message to 
be sent) that are potentially dependent on the past channel 
states in order to communicate the state as well as additional 
information to the destination. 

In this framework, encoding is in two parts: given the 
message, an action sequence is created. The actions affect 
the formation of the channel states, which are accessible to 
the transmitter in a strictly causal manner when producing 
the channel input sequence. A channel with action-dependent 
states then is characterized by two ingredients: the distribution 
of state given an action p(s\a) and, the distribution of the 
channel output given the input and state p(y\x, s). We are 
interested in the scenario when in addition to communicating 
pure information across the channel, the transmitter also 
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wishes to help reveal the channel state to the receiver. We 
characterize the tradeoff between the independent information 
rate and the accuracy of estimation of the channel state via the 
capacity-distortion function (first introduced in [2|). The wide 
applicability of our framework can be seen in the following 
problems which can be expressed as a problem of conveying 
action dependent state to the destination: active classification 
1 6 1, underwater path planning Q, 0, data storage over mem- 
ory with defects JHJ, dynamic spectrum access systems 
ifTDI etc.. 

Pure communication over channels with action-dependent 
states was introduced in [1| wherein the capacity of such a 
channel with both non-causal and causal state information at 
the encoder were characterized. Due to our goal of acquiring 
the channel at the destination as well as information transmis- 
sion, a distinctly different approach is taken herein relative 
to the Gelfand-Pinsker methodology adopted in [ 1 1 . However, 
we are able to recover the results of HI for the causal case 
(extending the same proof strategy as in strictly causal case), 
revealing an alternative proof strategy. We observe that the 
codes which are optimal for achieving capacity may not be 
good codes for state estimation. 

Alternatively, our work extends that of joint communication 
and state estimation in ifTTl . 0, (3), [|4]; conditioned on the 
action sequence, we have such a problem. The role of the 
action sequence is to not only communicate the message, but 
to also determine a good communication channel for both the 
message as well as the state estimation. The contributions of 
our work is as follows: we characterize the capacity distortion 
function for this problem via a two stage encoding scheme. In 
stage one, information is encoded in the action sequence; in 
stage two, conditioned on the action sequence, a block Markov 
strategy akin to that in [3| is shown to be capacity-distortion 
optimal. We show that strictly causal CSI improves the channel 
estimate, while the capacity is unchanged. Our results are 
generalized to the case where the action sequence is a function 
of both the message and the past channel states (feedback), we 
denote this as adaptive action; the benefits of such an encoding 
are quantified. In addition to the generalizations previously 
mentioned (i.e. 0, 0, 0, HI), we show that our adaptive 
action framework recovers prior results on multiple access 
channels with states H21, ifHl 

The rest of this paper is organized as follows. Section II 
describes the basic channel model with discrete alphabets, 
characterizes the capacity-distortion function, establishes its 
achievability and proves the converse part of the theorem. 
Section III extends the results to the adaptive action setting, 
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Fig. 1. Strictly causal action dependent state communication. 

wherein we allow the feedback from the past states to the 
action encoder. Section IV illustrates our results with few 
examples. Finally, Section V concludes the paper. 

II. Problem Setup and Main Result 

We assume a discrete memoryless channel (DMC) 
with discrete memoryless state (DMS) model [X x S x 
A,p(y\x, s)p(s\a),y) that consists of a finite input alphabet 
X, a finite output alphabet y, a finite state alphabet S, a 
finite action alphabet A and a collection of conditional pmfs 
p(y\x, s) on y. The channel is memoryless in the sense that, 
without feedback, p(y n \x n , s n ) = Yli=i Py\x,s{vMu s i)> and 
given the action sequence, the state is memoryless in the sense 
that (Si, S2, • • •) are independent and identically distributed 
(i.i.d.) with Si ~ ps(si\ai). 

A (2 nR ,n) code for strictly causal action dependent state 
communication consists of 

• a message set [1 : 2 nR ], 

• an action encoder that assigns an action sequence 
a n (m) G A n to each message m G [1 : 2 nR ] 

• a channel encoder that assigns a symbol Xi(m, s 1 ^ 1 ) G 
X to each message m € [1 : 2 nR ] and past state sequence 

gi-l g £4-1 for j £ [1 ■ ri ] > anc J 

• a decoder that assigns a message estimate rh € [1 : 2 nR ] 
(or an error message e) and a state sequence estimate 
~n g gn t0 eac j 1 rece i ve( j sequence y n 6 3^"- 

We assume that A/ is uniformly distributed over the message 
set. The average probability of error is defined as Pe = 
P{M 7^ M}. The fidelity of the state estimate is measured by 
the expected distortion 

n 

E(d(S n ,S n )) = -Y,E(d(Si,S i )), 

i=l 

where d : S x S — >• [0, 00) is a distortion measure between 
a state symbol s G S and a reconstruction symbol s G S. 
Without loss of generality, we assume that for every symbol 
s G S there exists a reconstruction symbol s G S such 
that d(s, s) = 0. A rate-distortion pair is said to be achiev- 
able if there exists a sequence of (2 nR , n) codes such that 
lim^oo F e (n) = and limsup^^ E d(S n , S n ) < D. The 
capacity-distortion function Cg C (D) is defined as in (2J and 
is the supremum of the rates R such that (R, D) is achievable. 

We characterize this optimal tradeoff between information 
transmission rate (capacity C) and state estimation (distortion 
D) as follows. 



Theorem 1: The capacity-distortion function for strictly 
causal action dependent state communication is 

Ci c (D) = max(/([/, A, X; Y) ~ I(U, X;S\A)) , 

where the maximum is over all conditional pmfs 
p(a)p(x\a)p(u\x, s, a) and function s(u, x, a, y) such 
that E(d(S, S)) < D and I(U, X; Y\A) - I(U, X- S\A) > 0. 

Remark 1: We observe that Cg C (D) remains unchanged for 
seemingly for general channels of the form p(y\s,x,a). This 
fact can be shown directly by defining a new state S' = (S, A) 
and applying the above characterization. 

Remark 2: When both the sender and the receiver is obliv- 
ious of the channel state, the capacity-distortion function for 
action dependent state communication can be obtained by 
choosing U = and is given by, 

C A (D) = maxI(X,A;Y), 

where the maximum is over all conditional pmfs p(a)p(x) and 
function s(x,a,y) such that E(d(S, S)) < D. 

Before proving the Theorem [T] we recall a lemma from [3] 
and summarize a few useful properties of Cg C (D) (similar to 
the 15] Corollary 1],J2)), which will be useful in proving the 
converse. 

Lemma 1: Suppose Z — > V — > W form a Markov chain 
and d(z, z) is a distortion measure. Then for every reconstruc- 
tion function z(v, w), there exists a reconstruction function 
z*(v) such that 

E[d(Z,z*(V))] < E[d(Z,z(V,W))]. 

This lemma traces back to Blackwell's notion of channel 
ordering [14|, [15| and can be interpreted as a data processing 
inequality for estimation. 

Corollary 1: The capacity-distortion function Cg C (D) in 
Theorem [T] has the following properties: 

(1) Cgc(D) is a non-decreasing concave function of D for all 
D> D*, 

(2) Cg C (D) is a continuous function of D for all D > D* , 

(3) C£ C (D*) = if D* ^ and C SC (L>*) > if D* = 0, 
where D* is the minimum distortion with strictly causal 
channel state at the sender akin to the zero rate case in Q. 

A. Sketch of Achievability: 

We use b transmission blocks, each consisting of n symbols. 
The channel encoder uses a rate-splitting technique, whereby 
in block j, it appropriately allocates it's rate between cooper- 
ative transmission of common message rrij and a description 
of the state sequence S n (j — 1) in block j — 1. Typical sets 
are defined as in |fl6ll . 

Codebook generation. Fix a conditional pmf 

p(a)p(x\a)p(u\x, s, a) and function s(u,x,y,a) that attain 
Cg C (D/(l + e)), where D is the desired distortion, and 
let p(u\x, a) = J2 s p(s\a)p(u\x,s,a). For each j G [1 : b], 
randomly and independently generate 2 nR sequences a n (mj), 
rrij G [1 : 2 nR ], each according to n"=i-P^( a i) and f° r 
each a n (nij), generate 2 nRs sequences x n (mj,lj_i), 



vrtj £ [1 : 2 nR ),l j _ 1 £ [1 : 2 nRs ], each according to 
U2=iPx(xi\ai)- For each m j e [1 : 2" i? ],/,_ 1 g [1 : 2**f], 
randomly and conditionally independently generate 2 nRs 
sequences u n (kj\mj,lj-i), kj £ [1 : 2 nRs ), each according 
t0 Il™ = iPu\x,A( u i\ x i( m j' l j-i)' a i( m j))- Partition the 
set of indices kj £ [1 : 2 nRs ) into equal-size bins 
B{lj) = [(Zj— l)2 n ^ s - fls )+l : ^-2"^-^)], G [1 : 2 ni?s ]. 
The codebook is revealed to the both encoder and the decoder. 

Encoding. By convention, let Iq = 1. At the end of block j, 
the sender finds an index kj such that 

(s n (j),u n (k j \m j ,l j _ 1 ),x n (m j ,l j _ 1 ),a n (m j )) G tJ" } . 

If there is more than one such index, it selects one of them 
uniformly at random. If there is no such index, it selects an 
index from [1 : 2 nRs ] uniformly at random. In block j + 1, the 
action encoder chooses the action sequence a n (m,j+i), where 
rrij+i is the new message index to be sent in block j + 1. Let 
s n (j + 1) be the channel state sequence generated in response 
to the action sequence. The channel encoder then transmits 
x n (rrij+i, lj) over the state dependent channel in block j + 1, 
where lj is the bin index of kj. 

Decoding. Let e > e'. At the end of block j + 1, 
the receiver finds the unique index m J+1 ,^ such that 
(x n (rhj +u lj),y n (j + l),a"(m i+1 )) e Te {n) ■ It then looks 
for the unique compression index kj £ B(l j) such that 

^u n (kj\mjjj-i),x n (mjjj_ 1 ),a n (m j ),y n (j)) G Te n) and 
kj G B(lj). Finally it computes the reconstruction sequence 

as Si(j) = s(ui(kj\mj,lj-i),Xi(mjJj-i),ai(mj),yi(j)) for 
i £ [1 : n]. 

Note that the achievablity scheme resembles the one in (3), 
as in conditioned on the action sequence a"(m), we use 
a similar block Markov strategy to convey the state to the 
decoder. So essentially the action sequence adds one more 
degrees of freedom to the framework of 

B. Proof of the Converse 

We need to show that given any sequence of (2 nR , n)-codes 
with lim„^ 00 P e ( ™ ) = and E(d{S n ,S n )) < D, we must 
have R < C§ C (D). We identify the auxiliary random variables 
Ui := (M, S i ~ 1 ,Yfi_ 1 ,A n \ i ), i G [1 : n] with n\i = [1 : 
n] - i and (S , Y n+ i) = (0, 0, 0). Note that, as desired, U l ->• 
(Xi, Si) — > Yi form a Markov chain. Consider 

nR = H(M) 

<I(M;Y n )+ne n 

= J £l(M;Y i \Y^ +1 ) + ne n 

i=l 

n 

<^/(A/,y^ 1 ;y l ) + ^« 

i=l 

71 

= ' s ^2 l (I(M,Yfi i _ 1 ,S i ~ 1 ;Yi) — I(S i ~ 1 ;Y i \M,Y£_ 1 )) 

i=l 
+ ne n 



— 53 Y^-y , S" 1-1 , A n ; Yi) 

71 

- 53 7(1^!; Si | M, S l -\A n )+ne n 

i=l 
n 

— 53 ^(M, Y^ lt S i ~ x ,A n ;Y i ) 

1=1 

n 

- 5] J(M, 5*- 1 , 1^, Si\Ai) + ne n 

i=l 

71 

( = } ^2{I(U U X u Ai- Yi) - I(Ui, X,- Si\Ai)) + ne n , 
»=i 

where (a) can be shown by Fano's inequality (see [17 The- 
orem 7.7.1]), (b) follows from the Csisiar sum identity lfl8l 
Sec. 2.3] and since A n is a function of M, (c) follows from 
the fact that given Ai, (M, S 1 ' 1 , A n \*) is independent of Si, 
and (d) is true as Xi is a function of (M, S' 1 ^ 1 ). Similarly, for 
this choice of Ui, 

n n 

Y,I(Ui,X i ;S i \A i ) = Y,I(M,S i - 1 , YP +1 , A n \* ,Xi\ Si \Ai) 

i=l i=l 
n 

= J £l(Y£ 1 ;S i \M,S i -\A n ) 

i=l 
71 

^^IiS^^M^^A") 

i=l 

71 

K^IiM^^^A^YilAi) 

i=l 

71 

= ^I(Ui,Xi;Yi\Ai). 

i=l 

So now we have 

^ n n 

R<-J2 I (Ui,Xi,Ai;Yi)-Y,I(U i ,Xi ] Si\Ai) + ne n 

i=l i=l 
( a ) 1 U 

< - 53 c sc (E(d(5i , *<(D*, ^, ^, F,)))) + ne„ 

i=l 

< (i ^ E(d(5i, h(U h X h Ai,Yi)))) + ne n 

i=i 

<Cg c (D), 

where (a) follows from the definition of capacity-distortion 
function, (b) follows by the concavity of C^ C (D) (see Property 
1 of Corollary [TJ, and (c) can be shown using Lemma [T] and 
Corollary [T] This completes the proof of Theorem [Tj Note that 
main difficulty of the converse is to identify U, which not only 
has to satisfy the rate and distortion condition (as in [4|), but 
also need to satisfy the additional information inequality. 

III. Adaptive Action 
It is natural to wonder whether "feedback" from the past 
states at the action stage (ai(m, s !_1 )) increases the capacity- 
distortion function or not. For an extreme example, consider 



a channel for which p(y\s, x, a) = p(y\s,a). Clearly, the 
capacity-distortion function for any such channel with only 
message dependent non-adaptive action (a™(m)) is same as 
that of no CSI, since the action encoder is oblivious of the 
channel state. But with adaptive action, the action encoder 
can perform a block Markov strategy to yield a potentially 
larger capacity-distortion function, which is summarized be- 
low without proof. 

Theorem 2: The capacity-distortion function for strictly 
causal adaptive action dependent state communication is 

C AA (D) = max(/(C/, A, X; Y) - I(U, X, A; S)) , 

where the maximum is over all conditional pmfs 
p(a)p(x\a)p(u\x, s, a) and function s(u,x,a,y) such 
that E(d(S, S)) < D. 

Note that the unconstrained capacity remains unchanged 
even if we allow the actions to depend on the past states. In 
general, C^ A (D) > Cg C (D) as the adaptive action helps the 
receiver to get a better estimate of the state. Finally, by setting 
A = in Theorem [5J we recover the result by E| on the 
capacity-distortion function when the i.i.d. state information 
is available strictly causally at the encoder. 

Remark 3: When the past states are available at both the 
encoders, the encoders cooperate to send information consist- 
ing of the common message and a description of the state 
in previous block (similar to sending a common message 
over multiple access channel (MAC)), whereas in the non- 
adaptive action scenario, while the common message is sent 
cooperatively, description of the state is a private message of 
the channel encoder. 

IV. Illustrative Examples 

In the following subsections, we illustrate Theorem [T] and 
Theorem [2] through examples. 

A. Actions Seen by Decoder: 

Consider the case where the decoder also has access to the 
actions taken. Noting that this is a special case of our setting by 
taking the pair (Y, A) as the new channel output, that U — > 
(X,S,A) -> Y if and only if U (X,S,A) -> (Y, A). 
We obtain that the capacity-distortion function for the case of 
message depepdent action is given by 

Ci c (D) = max(H(A) +I(U, X;Y | A) -I(U,X; S \ A)), 

where the maximization is over the same set of distributions 
and same feasible set as in Theorem [T] Similarly we can eval- 
uate the capacity-distortion function for the case of adaptive 
actions. This expression is quite intuitive: The amount of infor- 
mation per symbol that can be conveyed through the actions in 
the first stage is represented by the term H(A). In the second 
stage, both encoder and decoder know the action sequence, 
so they can condition on it and can perform the usual block 
Markov strategy on each subsequence associated with each ac- 
tion symbol, achieving a rate of 1(17, X; Y\A) — I(U, X; S\A). 
The maximization is a search for the optimal tradeoff between 
the amount of information that can be conveyed by the actions, 
and the quality of the second stage channel that they induce. 
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Fig. 2. Capacity-distortion function: adaptive vs. non-adaptive 

B. Gaussian Channel with Additive Action Dependent State 

Consider the Gaussian channel with additive action depen- 
dent state |Q] 

Y = X + S + Z = X + A + S + Z, 

where S ~ N(0, Q) and the noise Z ~ N(0, N) are indepen- 
dent. Assume an expected average power constraint on both 
the channel and action encoder 

n n 

E(z?(m, S*- 1 )) < nP x ^ E ( a f) < nP A . 

t=i i=i 

We consider the squared error (quadratic) distortion measure 
d(s, s) — (s — s) 2 . When the action sequnce is only a function 
of the message, using Theorem [T] we have the following. 

Proposition 1: The capacity-distortion function of the 
Gaussian channel with message dependent action is 





|log( 
1 C ( 



,QN/D 



0<D< Di in , 



Dt 



< D <D n 



where C(x) 



Q+N 

= log(l ~ 
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D > D r , 
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'Pa(Px (Q + N)))- 

When we allow the action encoder to observe the past 
states (adaptive action), the capacity-distortion follows from 
Theorem[2]and it has the similar form of Proposition[T] but P A 
and -D^ in are replaced by P AA and D^f^ respectively, where 
P AA = P x +Q+N+P A +2^PJPx~ and D AA n = QN/P AA . 

The proof of the proposition is omitted here for brevity. 
Note that since P AA > P A , the capacity-distortion function 



is larger in the adaptive action scenario (see Figure IV-B I. In 
fact, the minimum distortion achievable with adaptive action is 
smaller than that of non-adaptive action. But the unconstrained 
capacity (capacity-distion function for D > D max ) is same 
in both the cases, which implies that adaptive action in useful 
in estimation rather than in information transmission. Finally 
by substituting P A = 0, both the capacity-distortion functions 
reduces to that in 0. 
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Fig. 3. State dependent MAC with strictly causal CSI at both encoders. 

C. State dependent MAC 

Consider communicating a common message over a mem- 



oryless state-dependent MAC (see Figure IV-B I characterized 
by p(y\s,Xi,x 2 ), where the state sequence is known strictly- 
causally to both encoders. This problem can be seen as a 
special case of our adaptive action setting via the following 
associations: 

A = X 2 ,X = Xi,p(s\a) = p(s),p(y\s,a,x) = p(s,xi,x 2 ). 

Applying Theorem [2] to this case, keeping in mind the Re- 
mark [T] following the statement of the Theorem [T] regarding 
channels of the form p(y\s, x, a), we get that the capacity- 
distortion function is given by 

Cf c (D) = max(/(f/, X 2 , X 1 ; Y) - I(U, X i; S\X 2 )), 

where the maximum is over p(xi, x 2 )p(u\xx, s, x 2 ) and func- 
tion s(u, x-y, x 2 , y) such that E(d(S, S)) < D. This setting 
was considered in fT2"l . |fT~3Tl and it recovers the common 
message capacity results of [12|, |13|. One can also consider 
a scenario where the state sequence is known strictly-causally 
to the first encoder, but unknown at the second encoder and 
at the receiver. This problem, motivated by multiterminal 
communication scenarios involving transmitters with different 
degrees of channel state information, is a special case of 
Theorem [T] We can show that the capacity-distortion func- 
tion (Csc(D)) is the same as C§ C (D) with the additonal 
constraint of I(U,Xn Y\X 2 ) - I(U, Xi; S\X 2 ) > on the 
feasible distributions. Clearly C§ C (D) > C^f (D), since with 
symmetric channel state information, the encoders can jointly 
perform both message and state cooperation as opposed to only 
message cooperation when the state information is available 
at only one of the encoders. 

V. Conclusions 

Motivated by an active classification problem with au- 
tonomous vehicles, we combine the frameworks of [ 1 1 and 
0, to examine the problem wherein the formation of channel 
states is affected by actions taken at the encoder; further, 
the decoder has the two simultaneous goals of estimating the 
channel state up to some distortion and simultaneously de- 
coding the transmitted message. We characterize the capacity- 
distortion function for this problem where the channel states 



are known strictly causally at (a) only the channel encoder, and 
(b) both the action encoder and channel encoder. By realizing 
that, conditioned on the action sequence, our framework is 
similar to that in Q, we have shown that a two stage 
encoding strategy is optimal. In the first stage, the action 
is communicated and in the second stage, conditioned on 
the action sequence, a block Markov strategy is performed 
to utilize the strictly causal CSI at the encoder(s). We have 
also shown that the state-dependent MAC with symmetric 
and asymmetric state information is a special case of our 
framework and thus are able to use our results to recover the 
common message capacity results of the MAC with strictly 
causal CSI (see flj), El). 
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